Your Transformer is Secretly an EOT Solver
elonlit.comยท22hยท
Discuss: Hacker News
๐Ÿ“ŠShannon Entropy
Flag this post
Beyond the Black Box: Making LLM Decoding Truly End-to-End
dev.toยท10hยท
Discuss: DEV
โœ‚๏ธTokenization
Flag this post
Empirical Bayesian Multi-Bandit Learning
arxiv.orgยท23h
๐Ÿ“ŠBayesian Inference
Flag this post
Maths behind ML Algorithms (Bayesian Decision Theory)
pub.towardsai.netยท2d
๐Ÿ“ŠBayesian Inference
Flag this post
GIR-Bench: Versatile Benchmark for Generating Images with Reasoning
paperium.netยท11hยท
Discuss: DEV
๐Ÿ—‚๏ธObsidian
Flag this post
From Lossy to Lossless Reasoning
manidoraisamy.comยท9hยท
Discuss: Hacker News
โœ‚๏ธTokenization
Flag this post
Algorithmic Randomness, Exchangeability, and the Principal Principle
arxiv.orgยท2d
๐Ÿ“ŠShannon Entropy
Flag this post
My ML Learning Journey: From Confusion to Building a Working Model
kaggle.comยท21hยท
Discuss: DEV
๐Ÿค–Machine Learning
Flag this post
Building a Rules Engine from First Principles
towardsdatascience.comยท1d
๐Ÿ“ŠShannon Entropy
Flag this post
Minimal Sufficiency: A Principle โ€˜Similarโ€™ to End-to-End
cacm.acm.orgยท8hยท
Discuss: Hacker News
๐ŸŒFederated Systems
Flag this post
Optimal Information Combining for Multi-Agent Systems Using Adaptive Bias Learning
arxiv.orgยท23h
๐Ÿ”—Mutual Information
Flag this post
Stochastic computing
scottlocklin.wordpress.comยท10h
๐Ÿ”—Mutual Information
Flag this post
How Do We Evaluate the Quality of LLMs' Mathematical Responses?
lesswrong.comยท2d
๐Ÿ“ˆSearch Quality
Flag this post
Understanding Hardness of Vision-Language Compositionality from A Token-level Causal Lens
arxiv.orgยท23h
โœ‚๏ธTokenization
Flag this post
StreetMath: Study of LLMs' Approximation Behaviors
arxiv.orgยท23h
๐Ÿค–Local LLMs
Flag this post
How fast can an LLM go?
fergusfinn.comยท1dยท
Discuss: Hacker News
๐Ÿค–Local LLMs
Flag this post
Bayesian continual learning and forgetting in neural networks
nature.comยท2d
๐Ÿ“ŠBayesian Inference
Flag this post
Context-Bench: Benchmarking LLMs on Agentic Context Engineering
letta.comยท8hยท
Discuss: Hacker News
๐Ÿ’ฌNatural Language Processing
Flag this post
An underqualified reading list about the transformer architecture
fvictorio.github.ioยท1dยท
Discuss: Hacker News
๐Ÿ’ฌNatural Language Processing
Flag this post
Anthropic Research Shows How LLMs Perceive Text via @sejournal, @martinibuster
searchenginejournal.comยท1d
โœ‚๏ธTokenization
Flag this post